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This article presents a seif* contained introduction to the algebraic theory of 
convolutional codes, which is partly tutorial, hut at the same time contains a number 
of new results which will prove useful for designers of advanced telecommunication 
systems. Among the new concepts introduced here are the Hilbert series for a 
convolutional code and the class of compact codes. 


I. Introduction 

Convolutional codes have played a central part in 
NASA’s deep-space telecommunications systems for many 
years. In all such applications to date, the convolutional 
codes have been codes of dimension 1 , which are com- 
monly, but not strictly correctly, referred to as “rate 1 /n” 
codes. However, as systems become more sophisticated, 
the coding subsystems must keep pace, and this article is 
an outline of algebraic theory for the most general class 
of convolutional codes known, the so-called “(n,fc,m)” 
codes, of which the usual “rate 1/n” codes form the special 
case jfc = 1. Much of this theory was originally developed 
by Forney [1 — 4], but this article adds to what is already 
known, as well as placing many of the older results into 
a modern, “system-theoretic” context. In particular, in- 
troduced here for the first time is the “Hilbert series” for 
a convolutional code, which is a generating function from 
which the dimensions of certain polynomial subcodes can 
be easily computed in terms of the “Forney indices” of the 
code. The Forney indices provide a derivation of an upper 
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bound on the free distance of a convolutional code which 
in some cases improves the bounds previously known, and 
whose derivation makes no use of the structure of any par- 
ticular encoder structure. Finally, the notion of “compact” 
and “noncompact” convolutional codes is introduced, and 
it is argued that only compact codes are likely to be inter- 
esting for applications. 


II. Convolutional Codes: Polynomial 
Generator Matrices 

Let F be a field, usually GF( 2), and let F(D) be the 
field of rational functions over F. An (n, k) convolutional 
code over F is a ifc-dimensional subspace of F(D) n . The 
elements of the code are called its codewords . A code- 
word is thus an n-tuple of rational functions over F{D). 
The weight of a codeword is defined to be the sum of the 
weights of its components, where the weight of a com- 
ponent (i.e., rational function) is the number of nonzero 
coefficients in its expansion as a Laurent series in increas- 
ing powers of D. The free distance of a convolutional code 
is defined to be the minimum nonzero weight of any code- 
word. 
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If C is an (n,fc) convolutional code over F % a generator 
matrix G(D) for C is a k x n matrix over F(D) whose rows 
form a basis for G. If the entries of G(D) are polynomials, 
then G(D) is called a polynomial generator matrix (PGM) 
for C . Any convolutional code has a polynomial generator 
matrix, since if G is an arbitrary generator matrix for G, 
the matrix obtained from G by multiplying each row by the 
least common multiple of the denominators of the entries 
in that row is a PGM for C. 

Let G(D) = (gij(D)) be a k x n PGM for C. The ith 
row of G, i.e., the n- vector (< 7 , 1 , . . . , < 7 * n ), is denoted by < 7 , , 
and the degree of < 7 , is defined as the maximum degree of 
its components. In a similar way, the degree of any n- 
tuple of polynomials is defined as the maximum degree of 
any component. The internal degree and external degree 
of G(D) are defined as follows: 

int.deg. G(D) = maximum degree of G(D) y s 
k X k minors 

ext. deg. G{D) = sum of the row degrees of G(D) 

The following two definitions will be essential in the 
discussion of convolutional codes. 

A. Definition 1 

A k x n polynomial matrix G(D) is called basic if, 
among all polynomial matrices of the form T(D)G(D) } 
where T(D) is a nonsingular k x k matrix over F(D ), it 
has the minimum possible internal degree. 

B. Definition 2 

A k x n polynomial matrix G(D) is called reduced if, 
among all matrices of the form T(D)G(D) } where T(D) 
is unimodular , 3 G(D) has the minimum possible external 
degree. Since any unimodular matrix is a product of ele- 
mentary matrices, an equivalent definition is that a matrix 


is reduced if its external degree cannot be reduced by a se- 
quence of elementary row operations. 

Before continuing along the main line, it is helpful to 
present a simple theorem that provides several useful facts 
about the internal and external degrees of a polynomial 
matrix. 

Theorem 1 . Let G(D) be a k x n polynomial matrix. 

( 1 ) If T(D) is any nonsingular k x k polynomial ma- 
trix, then int.deg. T(D)G(D) = int.deg. G(D) -f 
deg.det.T(D). In particular int.deg. T(D)G(D) 
> int.deg. G(D), with equality if and only if 
T(D) is unimodular. 

(2) int.deg. G(D) < ext. deg. G(D). 

Proof: 

(1) The kxk submatrices of T(D)G(D) are just the 
k x k submatrices of G(D), each multiplied by 
T(D). Thus the kxk minors of T(D) are just 
the kxk minors of G{D) ) each multiplied by 
det T(D). The result now follows. 

(2) Denote the degree of the ith row of G(D) by e*. 

In the expansion of any kxk minor of G(D), 
each term is the product of k entries of G(D), 
one from each row (and column). Since each 
entry from the ith row has degree < e t , it follows 
that the degree of any kxk minor is at most 
e\ -h h e* = ext. deg. G(D). □ 

Basic and reduced polynomial matrices enjoy many use- 
ful and surprising properties. The Appendix gives two the- 
orems, Theorem A-l and Theorem A-2, delineating these 
properties. These Theorems will be referenced constantly 
in the rest of this article. 

Example 1. Here are eight generator matrices for a 
(4,2) convolutional code over GF( 2). Of these eight, six, 
G 2 through G 7 , are PGMs. 


Gi 


G 2 


1 

1 + D + D 2 


1 


1 

1 + D+D 2 
D 


1 + D 2 
1 + D + D 2 

D 


f l 1 + D + D 2 1 + D 2 1-fD' 

D 1 + D + D 2 D 2 1 


1 + D \ 
1 + D + G 2 


h ) 


3 A unimodular matrix is a square polynomial matrix whose determinant is a nonzero scalar. 
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1 


C?3 = 


G4 = 


1 + D + D 2 l + £> 2 1 + D > 

0 l + D D 1 

1 D 1 + D 0\ 

0 1 + D D l) 


Theorem 2. A PGM G(D) for the convolutional code 
C is minimal if and only if it is both basic and reduced. 

Proof: First it will be shown that a minimal PGM 
must be both basic and reduced. Then it will be shown 
that a PGM that is both basic and reduced must be min- 
imal. 


G 5 = 


Ge = 


G 7 = 


f l + D 0 1 D > 

D 1 + D + D 2 d 2 1 
r l 1 1 1 \ 

t 0 l + D D 1/ 
l + D 0 1 D\ 

1 D l + D 0 / 


To prove the first assertion, denote by mo the common 
internal degree of all the basic PGMs for G, and among all 
the basic PGMs choose one, say Go(D), for which the ex- 
ternal degree is as small as possible. Then Go must be re- 
duced, since if T(D) is unimodular, int.deg. T(D)Go(D) = 
int.deg. Go(D) = mo by Theorem 1, and so by the defini- 
tion of Go, ext.deg. T(D)Go(D) > ext. deg. Go(D). Now 
let G(D) be any minimal PGM for G. Then 

int.deg. Go < int.deg. G < ext.deg. G < ext.deg. Go (1) 


Gg 



1 

TTd 

D 

l + D 


D 


\ 


l + D 
1 

i + d/ 


Table 1 lists the properties of the generator matrices 
Gi, . . . , Gg- (Note that only the polynomial generator ma- 
trices, viz., G2-G7, have external or internal degrees, or 
can be basic or reduced.) These properties are easily ver- 
ified by referring to Theorems A-l and A- 2. For example, 
G3 is basic because the gcd of its 2 x 2 minors is 1 (Theo- 
rem A-l, condition (2)), and it is not reduced because its 
internal and external degrees are unequal (Theorem A- 2, 
condition (2)). □ 

It follows from Definition 1 that among all PGMs for 
a given convolutional code, those for which the internal 
degree is as small as possible are exactly the basic PGMs. 
It turns out, however, that the set of PGMs for which the 
external degree is as small as possible form a much more 
interesting class, the class of minimal PGMs. 


(The first inequality in Definition 1 follows from the fact 
that Go is chosen to have minimum possible internal de- 
gree. The second inequality follows from Theorem 1(2). 
The third inequality is because G, as a minimal PGM, 
has minimum possible exernal degree.) But since Go(D) 
is reduced, by Theorem A-2, condition (2), int.deg. Go = 
ext.deg. Go, so that equality holds throughout Definition 1. 
Thus int.deg. G = int.deg. Go = m 0 , so G is basic; and 
int.deg. G = ext.deg. G, so that G is reduced, by Theo- 
rem A-2, condition (2). 

Conversely, suppose that G(D) is basic and reduced, 
and Go(D) is any other PGM for G. Then by Theo- 
rem 1(2), ext.deg. Gq(D) > int.deg. Go(D). Since G(D) 
is basic, int.deg. Go(D) > int.deg. G(D); since G(D) is 
reduced, by Theorem 1 int.deg. G(D) = ext.deg. G(D). 
Combining these inequalities, ext.deg. Go(D) > ext.deg. 
G(D), which proves that G{D) is minimal. □ 

In the proof of Theorem 2, mo, the common internal 
degree for all PGMs for G, is equal to degC, i.e., the 
minimum possible external degree. Thus there are two 
corollaries to Theorem 2. 


C. Definition 3 

Among all PGMs for a given convolutional code G, 
those for which the external degree is as small as possible 
are called minimal PGMs. This minimal external degree 
is called the degree of the code G, and is denoted deg G. 

It will be shown that minimal generator matrices have 
many remarkable properties. The key to these properties 
is the following theorem: 


Corollary 1. The minimal internal degree of any PGM 
for a given convolutional code G is equal to the degree of 
G. 

Corollary 2. If G is any basic generator matrix for G, 
then int.deg. G = deg G. 

The following theorem shows that minimal generator 
matrices are “minimal” in a very strong sense. 
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Theorem 3. If ei < e 2 < ■ • • < e* are the row degrees 
of a minimal generator matrix for a convolutional code C, 
and if /i < /2 < * * • < /fc are the row degrees of any other 
polynomial generator matrix, say G\ for C, then e, < /,*, 
for i = 1 , . . . , k. 

Proof: If the statement is false, there exists an in- 
dex j such that e\ < < fj , but e ; +i > fj+i- 

It then follows from the minimality of G (use the prop- 
erties in Theorem A-l, condition (5), and Theorem A-2, 
condition (3)) that the first j + 1 rows of G* must be poly- 
nomial linear combinations of the first j rows of G, which 
contradicts the fact that the rows of G* are linearly inde- 
pendent. d 

Theorem 4. The set of row degrees is the same for all 
minimal PGMs for a given code. 

Proof: This result follows immediately from Theo- 
rem 3. □ 


n-dimensional polynomial vectors of degree < L over F. 
The F-dimension of Cl is denoted by 6l . The following 
theorem shows that the bC s can be computed from the 
Forney indices. 

Theorem 5* If C is an (n,fc) convolutional code with 
Forney indices (ej, . . . , e *) and polynomial subcode dimen- 
sions 6l , then 

Note: The power series appearing in Theorem 5 is called 
the Hilbert series for the code . 

Proof: Let G(D) be a minimal PGM for C, whose row 
degrees are the ordered Forney indices, say e\ < • • * < e* , 
and let <71 , . . . , <7* be the rows of G . Let y(D) be any poly- 
nomial codeword of degree < L. Then it follows from The- 
orem A-l, condition (5), that y(D) = x(D)G(D ), where 
x(D) = (xi(D), . . . , ar*(£))) is a k - vector of polynomials, 
and it follows from the predictable degree property (Theo- 
rem A-2, condition (3)) that degx» 4* e, < L. Thus a basis 
for the F-space Cl is the set {D*gi(D) : j + e t < L}. Hence 


The row degrees referred to in Theorems 3 and 4, say 
(d, e2 , . . . , ej t), are called the Forney indices of the code. 
The sum e\ 4* f e* of the Forney indices, i.e., the min- 

imum possible external degree of any PGM for C, is the 
degree of the code. The maximum of the Forney indices is 
called the memory of the code. From now on, reserve the 
letter m to denote the degree of a given convolutional code, 
and refer to an (n,jfc) code with degree m as an (n,fc,m) 
code. An (n, k, m) code is called optimal if it has the max- 
imum possible free distance among all codes with the same 
value of n, k , and m. 


6lIL 

L> 0 


k 


= E£(‘* •+j _j_ . . .J 


k 


= EE 

1=1 j> 0 


t e '+> 
1 - t 




□ 


Example 2: Continuing the study of the (4, 2) code 
from Example 1, of the eight given generator matrices, 
only (j?6 is minimal (it satisfies condition (2) of Theorem A- 
1 and condition (2) of Theorem A-2, so it is both basic and 
reduced), so that the degree of the code C is 1, and the 
Forney indices are (0, 1). The code is thus a (4, 2, 1) code. 
From Example 3, below, it is in fact an optimal (4,2,1) 
code. □ 


Corollary 3. The following explicit formula for 6l 
holds: 

k 

Sl — ^2 max + 1 - c,- , 0) (3) 

»=i 

Proof: By elementary calculus, (1 — t)~ 2 = Ylj>oU + 
1)F . Applying this fact to Eq. (2), 


III. The Hilbert Series and Free Distance 
Bounds 

If C is a fixed (n,Jfc) convolutional code, a polynomial 
codeword of C is a codeword all of whose components are 
polynomials. Recalling that the degree of a polynomial 
vector is defined to be the maximum degree of any com- 
ponent, for any integer L > 0, Cl is defined as the set of 
polynomial codewords of degree < L. Cl is a vector space 
over F. Indeed, it is a subspace of the set of all possible 


L>0 


t Ci 




2 


= LE0' +1 )* e,+J 

i=l j> 0 

= E£o’+ 1 - c *)< i 

*=1 j>e t 
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Thus the coefficient of t L in the Hilbert series Y1l>o 
is max(L -f 1 - e t -,0), which is the desired proof. 

□ 

Corollary 4. Let C be an (n, k, m) convolutional code. 
Then for all L > 0 

6i > max((L -f l)k — ?n,0) (4) 

Furthermore, there is equality for all L > 0 in Eq. (4) if 
and only if the Forney indices assume only the two values 
[■ m/k\ and \m/k]. A code for which this is true will be 
called a compact code. 

Proof: Since max(x,0) > x, Eq. (3) implies that 6i > 

X:* = i(L + l-eO = (L+1 K'-£Li = {L+l)k-m. Since 

Sl > 0, too, it follows that <5 1 > max((L 4- l)Ar — ?n, 0) for 
all L > 0. Assuming that the Forney indices are ordered 
so that Ci < • • • < ejt, it follows from Eq. (3) that Sl = 
(L + l)k — m = max((L 4- l)k — m, 0) if L 4* 1 > ejt , and 
S L — 0 = max((L 4- l)k -m, 0) if L 4- 1 < c\ . If e* — ei < 1, 
one of these alternatives must hold for all L. On the other 
hand, if e* — e\ > 2, then there is at least one value of L for 
which e! < L + l < e*, in which case 0 < S\ < (L + l)/: — m, 
so that 6 1 / max((L 4- l)fc — m, 0). □ 

Incidentally, it is easy to show that in a compact 
code, there are exactly (m mod k) Forney indices equal to 
[m/Jk] , and k — {m mod k) Forney indices equal to [m/k\ . 
Thus for example if k = 4 and m = 13, a compact code 
will have Forney indices (3, 3, 3, 4). 

Since the subcode Ci forms an (n(L 4- linear 

block code over F, the free distance of C cannot exceed 
the minimum distance of Cl , for L = 0, 1, . . which leads 
to Theorem 6. 

Theorem 6. If C is an (n } k) convolutional code with 
Forney indices (ei, . . . , e*), then 

dfree{C) < min A 4- 

where A j p(n,ib) denotes the maximum possible minimum 
distance of an (n } k) linear block code over F. 

Note from Corollary 4 that 6l is minimized for all L, 
and so A^(n(L4- is maximized for all L, for a com- 

pact code. This suggests, but does not prove, that among 
all (n,jk,m) codes, the compact codes will have the largest 
free distances. In any case, for applications a bound on 


df ree is needed that applies to all (n.k.m) codes, regard- 
less of their Forney indices. Thus is offered the following 
corollary to Theorem 6, which gives an upper bound on 
dfree for all (n,A;,m) codes, regardless of the Forney in- 
dices. It is good to bear in mind, however, that it may be 
possible to improve the bound if the code is noncompact. 

Corollary 5. If C is an code, then 

dfreelC) < mm Af'(n(L + l),k(L 4 1) - m) 

Proof: Combine Theorem 6 with Corollary 4. □ 

The upper bound on df ree of Theorem 6 is attained for 
many, but not all, values of n, k , and m. The following 
examples will illustrate this. (All examples in this article 
are over the field GF( 2).) 

Example 3. Continuing the study of the (4, 2, 1) code 
in Examples 1 and 2, since the Forney indices are (0,1), 
by Theorem 5 the “Hilbert Series” for the code is 

,}+l 2 = 1 + 3f + 5< 2 + ■ • • + (2L + 1 )t L + ■■■ 

(I t) 

Thus the dimension of the Lth subcode Cl is 2L 4- 1. It 
then follows from Theorem 6 that the free distance of the 
code satisfies 

dfree(C) < min A GF(2 )(4 (L 4- 1), 2L 4- 1) 

In particular, for L = 0 the above bound gives df ree (C) < 
A 2 (4, 1) = 4. But in fact d iree = 4 for this code (use the 
generator matrix G4 to check this fact), so this particular 
code has the largest possible free distance for a (4,2,1) 
convolutional code, i.e., it is an optimal (4,2, 1) code. 

□ 

Example 4. Consider (n,fc,m) = (2, 1,2) codes. Since 
k = 1, there is only one Forney index, and so any (2, 1,2) 
code is compact. By Corollary 3, <5o = 0, and, for L > 1, 
d.L = L — 1, so that by Theorem 6 (here and hereafter 
VerhoeflTs tables [7] of the values of A r(n,k) are used 
when F — GF( 2)), 

d free < min(A(2, 0), A(4, 0), A(6, 1), A(8, 2), A(10, 3) . .) 

= min(oo, 00, 6, 5, 5, . . .) 

= 5 
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In fact, there is a well-known (2, 1,2) code with r/f ree = 5, 
whose (unique) minimal generator matrix is 

G(D) = (l + D‘ 2 1 + D + D 2 ) (5) 

(See [ 6 ], Chapter 9.) It follows then that the (2, 1,2) code 
defined by Theorem 6 is optimal. □ 

Example 5: Consider binary (4,3,2) codes. The For- 
ney indices of such codes must be either ( 0 , 1 , 1 ) (compact) 
or (0,0,2) (noncompact). In the first case, by Corollary 3, 
<5 0 — 1, <5i = 4, 62 = 7, etc., and so by Theorem 6 , 

d free <min(A(4,l),A(8,4),A(12,7),A(16, 10),.. .) 

= min(4, 4,4,4, . . .) 

= 4 

However, it turns out that there is no (4,3,2) code with 
df r ee = 4 [9]. The largest possible rff ree turns out to be 
df ree = 3, which is achieved by the second-order Wyner- 
Ash code [ 8 ]. A minimal PGM for such a code is 


/I 0 0 l\ 

Oil D] 

\D 0 1 1 / 

On the other hand, if the Forney indices Eire (0,0,2), then 
Corollary 3 tells one that So = 2, S\ = 4, 62 = 7, etc., and 
so by Theorem 6, 

d fre€ < min(A(4, 2), A(8, 4) , A(12, 7), A(16, 10), . . .) 

= min(2, 4, 4,4, . . .) 

= 2 

And indeed there is a (4,3,2) code with Forney indices 
(0,0,2) and df ree = 2. A minimal PGM for one such code 
is 

/ 1 0 0 1 \ 

0 10 1 

\0 0 1 1 + D 2 / 

Therefore, among (4,3,2) codes, only the compact ones 
can be optimal. □ 
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Appendix 

Basic and Reduced Matrices 


This appendix — a reference collection of many useful 
properties of basic and reduced matrices — begins with the 
basic matrices. 


• (1) (2): According to the definitions given in the 

previous paragraph , the product of the invariant factors 
of G is 


Theorem A-l. A kxn polynomial matrix G(D) is ba- 
sic (see Definition 1) if and only if any one of the following 
six conditions is satisfied: 

(1) The invariant factors of G(D) are all 1. 

(2) The gcd of the k x k minors of G(D) is 1. 

(3) G(a) has rank k for any a in the algebraic closure 
of F. 


Ai A 2 Ajt 

7l72 * ‘ ’ 7* = -T- ~Z T 

A 0 Ai A;t-i 

- 

Ao 

= A*; = (the gcd of the k x k minors of G) 


(4) G(D) has a right F[D] inverse, he., there exists an 
nxk polynomial matrix H(D) such that G(D)H (D) 

= h- 

(5) If y(D) = x(D)G(D) t and if y(D) G F[D] n } then 
x(D) G F[D) k . (“Polynomial output implies poly- 
nomial input ”) 


(6) G(D) is a submatrix of a unimodular matrix, i.e., 
there exists an (n — k) x n matrix L(D) such that 

^L(D)) ^ ias determinant 1. 


the n x n matrix 


Proof: The proof is logically rather involved. The fol- 
lowing implications will be proved: (Basic) — + ( 1) — ► (2) — ► 
(4) - (5) - (Basic); (2) - (3); (1) - (6). 

• (Basic) — i ► (1): Suppose that T is the k x n invariant- 
factor form for G, i.e., F = diag(7i,72, . ■ ■ ,7k) where 7 * — 
Aj/Ai-i, Aj being the gcd of the i x i minors of G. (Take 
Ao — 1 by convention.) Then there is a k x k unimodular 
matrix X and an n x n unimodular matrix Y such that 


XGY = r (A-l) 


(For a proof of this “invariant factor decomposition,” see 
[5], Theorem 6.3.16.) Thus if r* denotes the k x k ma- 
trix formed by the leftmost k columns of T, the matrix 
G f = r~ l XG is a polynomial matrix equivalent to G. 
Furthermore, since det(F J ^ 1 X) = detT^ 1 = (detr*)” 1 = 
(71 ■ ‘ ‘ 7Jfe ) ” 1 > unless the 7^’s are all 1, the internal degree 
of G ' is strictly less than that of T. Thus if the invariant 
factors of G are not all 1, then G is not basic, which proves 
(Basic) — ► (1). 


Hence the g^s are all 1 if and only if A* = 1. 

• (2) — ► (4): Suppose that the gcd of the k x k mi- 
nors of G(D) is 1, and denote the individual minors by 
A u(D) t for v = 1,2,..., (1). Then by Cramer’s rule 
for each i/ t there will exist a “pseudo- inverse” for G(D), 
with factor A t/ (Z)), i.e., an n x k matrix H U (D) such that 
G(D)H l/ (D ) = A i/ (D)7jf Since the gcd of the A t/ (D)’s 
is 1, there exists a polynomial linear combination of the 
A u (DYs equal to 1, say A„(D)A„(D) = 1. It follows 
that H(D) - K(D)Hv(D) is an n x k polynomial in- 
verse for G(D). 

• (4) — ► (5): Suppose that G(D) has annxfc polynomial 
inverse H(D ), and that x(D) = (x\ (D ), . . . , Xk{D)) is a k- 
vector of rational functions such that y(D) = x(D)G(D) 
is an n-vector of polynomials. Multiplying this equation 
on the right by H(D ), y(D)H(D) = x{D), which implies 
that x(D) is in fact a polynomial vector. 

• (^) (Basic): Suppose that property (5) holds, and let 
T(D) be an arbitrary nonsingular k x k matrix of rational 
functions such that G ; = TG is a polynomial matrix. Then 
by property (5), T must in fact be a polynomial matrix, 
so that by Theorem 1(1), int.deg.G / > int.deg. G, which 
means that G is basic. 

• (2) (3): Suppose that the gcd of the k x k minors of 

G(D) is 1, let a be an arbitrary element of the algebraic 
closure of F, and let p(D) be the minimal polynomial of 
a. Then there must be at least one k x k subdeterminant 
of G(D) which is not divisible by p(D), which means that 
the corresponding k x k submatrix of G(o) is nonsingular. 
Thus G(a) must have rank k. Conversely, suppose that 
the gcd of the k x k minors of G is not 1, which means 
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that it is divisible by some irreducible polynomial p(D). If 
a is a root of p(D) in some extension field of F , it follows 
that every k x k minor of G(a) is zero, which in turn means 
that G(a) has rank less than k. 

#(!)<-► (6): Suppose the invariant factors of G are all 1. 
Then the invariant-factor decomposition in Eq. (A-l) can 
be written as 


G=A(I k 0 *,„-*) B 


where A — X 1 and B — V 1 - Thus if B — 


Bu 

Br 


where By is k x n and B y is (n — k) x n, it follows that 

G = ABy. But the matrix ^ ^Bl ) * s un * mof ^ ai 5 snice 

it is obtained from the unimodular matrix B via a sequence 
of elementary row operations on the first k rows. Con- 

versely, if B = ( //(£)) ) is lmirnoc,u,ar > t,ien tlie e <l ua ' 

tion G(D) = I k (T k 0 k , n -k ) B shows that the invariant 
factors of G(D ) are all 1 . □ 


Theorem A-2. A k x n polynomial is reduced (see 
Definition 2) if and only if one of the following three con- 
ditions is satisfied: 

(1) If the “matrix of high-order coefficients” G is de- 
fined by 


Gij ~ coeff (Hj{D) 


where e^ is the degree of G(D)’s ?th row, then G 
has rank k. 

(2) ext.deg. G(D) = int.deg. G(D). 

(3) The “predictable degree property”: For any k- 

dimensional polynomial vector, i.e., any x(D) € 
F[D) k 

deg (x(D)G(D)) = max_(degXj(D) + deg <n(D)) 


Proof: The logical organization of this proof is as fol- 
lows: It shall be proved that (Reduced) — 1 - (1) — ► (2) — ► 
(Reduced), and (1) <-► (3). 

• (Reduced) — ► (1): Suppose property (1) is false. Then 

there is a nonzero ^-dimensional vector from T, say a = 


(c*i, . . . ,ajb), such that aG = 0. Now suppose that the 
rows of G are (g u . ..,£*) with deg g { = e it and e x < e 2 < 

. . . < ejk. Then from aG - 0, it follows that the coefficient 
of D €k in the linear combination 

g' k = cuD^-^gx + a^D^'^g^ + * ■ ■ + cx k D ek ~ ek g k 

is zero, so that the unimodular transformation of G(D) 
that replaces g k with g * k — and leaves the remaining rows of 
G unchanged—reduces the external degree of G. In other 
words, if property (1) is false, G is not reduced, which is 
(the contrapositive of) what the authors have set out to 
prove. 

• (1) — ► (2): Suppose that G has rank k, and denote 
the k x k submatrices of G by G„, for v = 1,2,...,(£). 
Then since rank G — k^ there is at least one index u o 
such that det G l/Q / 0. If now the row degrees of G are 
ci , ■ * * , c* , then (cf. the proof of Theorem 1(2)) the co- 
efficient of D Cl+ +efc in det G //0 is det G Uo ^ 0. Thus 
in t. deg. G > ext. deg. G. The opposite inequality is true 
for any matrix, as was shown in Theorem 1(2). 

• (2) — ♦ (Reduced): Suppose int.deg. G(T>) = ext. deg. 
G(D), and that T(D) is an arbitrary k x k unimodu- 
lar matrix. Then ext. deg. TG > int. deg. TG by The- 
orem 1(2); int. deg. TG - int.deg.G, by Theorem 1(2); 
and int.deg.G = ext. deg. T, by assumption. Combining 
this string of inequalities and equalities, ext. deg. TG > 
ext. deg. G, which proves that G is reduced. 

• (1) (3): Let x(D) = {x x (D ), . . . , ** (D)) be a fc- vector 

of polynomials, and let y(D) — (yi(D ) } . . . ,y n (D)) be de- 
fined by the equation y(D) = x(D)G{D). If the k rows of 
G{D) are denoted by gi(D ), . . . , g k (D) } then 


y(D) = x(D)G(D) 


= x l (D)g l (D) + --- + x k (D)g k (D) 


(A-2) 


If the degree of Xi(D) is di for i = 1, . . . , k, and the degree 
of g t (D) is e t for i - it follows from Eq. (A-2) 

that the degree of y(D) is at most d — maXi(d; + e*). 
Call d the “prediction” of the degree of y{D). To test the 
prediction, note that the vector of coefficients of D d in 
y(D) is a = (c*i , . . . , cx k )G } where a* is the coefficient of 
D d ~ tx in Xi(D). (At least one of the a t is nonzero,_since 
d { -hej = d must hold for at least one index i.) But aG / 0 
for all nonzero a’s if and only if G has rank k , and jso the 
prediction is true for all x(D )' s if and only if rank G = k. 

□ 

97 



Table 1. Generator-matrix properties. 


Property 

Basic? 

Reduced? 

In t. deg. 

Ext.deg. 

Gi 

_ 

_ 

_ 



G2 

No 

No 

3 

4 

Gz 

Yes 

No 

1 

3 

Gi 

Yes 

No 

1 

2 

G 5 

No 

Yes 

3 

3 

^6 

Yes 

Yes 

1 

1 

G 7 

No 

Yes 

2 

2 

Gb 

- 

- 

- 

- 
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